add Concat quantization by sfraczek · Pull Request #17448 · PaddlePaddle/Paddle

sfraczek · 2019-05-16T10:23:27Z

in files graph_pattern_detector.cc, graph_pattern_detector.h, cpu_quantize_pass.cc, cpu_quantize_pass.h, mkldnn_quantizer.cc, mkldnn_quantizer.cc:

added Concat quantization code

in file mkldnn_quantizer.cc:

handling of multiple tensors wired to single input
extended list of ops that do not modify the sign of values in tensor,
added setting type to unsigned after regular ReLU op

paddle/fluid/inference/api/mkldnn_quantizer.cc

paddle/fluid/framework/ir/mkldnn/cpu_quantize_pass.cc

sfraczek · 2019-05-21T15:29:43Z

I have to add use_quantizer flag to concat operator which because its missing in this PR

add unit test for quantizing concat fix for wrong value when the input is not in map of calculated scales add use_quantizer to concat_op.cc add scale_algo rules for concat test=develop

test=develop

luotao1 · 2019-05-27T13:29:06Z

Which model uses this quantization, do you have some benchmark before and after this quantization?

sfraczek · 2019-05-27T13:52:39Z

Googlenet and mobilenet-ssd benefit from this (but we don't have test for mobilenet-ssd yet).
GoogleNet is about 1.67x faster with concat quantized vs before (on my i9 for development with batch size 50 and 1000 iterations).

$@sfraczek$ sfraczek added Intel int8 labels May 16, 2019

$@sfraczek$ sfraczek requested review from luotao1 and wojtuss May 16, 2019 10:23

wojtuss reviewed May 16, 2019

View reviewed changes

paddle/fluid/inference/api/mkldnn_quantizer.cc Outdated Show resolved Hide resolved

wojtuss reviewed May 21, 2019

View reviewed changes

paddle/fluid/framework/ir/mkldnn/cpu_quantize_pass.cc Outdated Show resolved Hide resolved

wojtuss previously approved these changes May 23, 2019

View reviewed changes

$@sfraczek$ sfraczek dismissed wojtuss’s stale review via 0c318ab May 24, 2019 08:01

Sylwester Fraczek added 3 commits May 24, 2019 11:53

add Concat quantization

1f24ab2

add unit test for quantizing concat fix for wrong value when the input is not in map of calculated scales add use_quantizer to concat_op.cc add scale_algo rules for concat test=develop

missing fix for multiple inputs quantize-squash

ef41ff6

wojtuss review fix: adding comment

6f1e608

test=develop

wojtuss approved these changes May 27, 2019

View reviewed changes

luotao1 merged commit 96845d2 into PaddlePaddle:develop May 27, 2019

wojtuss added this to the v1.5 for Intel milestone May 28, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Concat quantization#17448

add Concat quantization#17448
luotao1 merged 3 commits intoPaddlePaddle:developfrom
sfraczek:concat-quantization

$@sfraczek$ sfraczek commented May 16, 2019 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

sfraczek commented May 21, 2019

Uh oh!

luotao1 commented May 27, 2019

Uh oh!

sfraczek commented May 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sfraczek commented May 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sfraczek commented May 21, 2019

Uh oh!

luotao1 commented May 27, 2019

Uh oh!

sfraczek commented May 27, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

$@sfraczek$ sfraczek commented May 16, 2019 •

edited

Loading